U-Compare: share and compare text mining tools with UIMA

نویسندگان

  • Yoshinobu Kano
  • William A. Baumgartner
  • Luke McCrohon
  • Sophia Ananiadou
  • K. Bretonnel Cohen
  • Lawrence Hunter
  • Jun'ichi Tsujii
چکیده

SUMMARY Due to the increasing number of text mining resources (tools and corpora) available to biologists, interoperability issues between these resources are becoming significant obstacles to using them effectively. UIMA, the Unstructured Information Management Architecture, is an open framework designed to aid in the construction of more interoperable tools. U-Compare is built on top of the UIMA framework, and provides both a concrete framework for out-of-the-box text mining and a sophisticated evaluation platform allowing users to run specific tools on any target text, generating both detailed statistics and instance-based visualizations of outputs. U-Compare is a joint project, providing the world's largest, and still growing, collection of UIMA-compatible resources. These resources, originally developed by different groups for a variety of domains, include many famous tools and corpora. U-Compare can be launched straight from the web, without needing to be manually installed. All U-Compare components are provided ready-to-use and can be combined easily via a drag-and-drop interface without any programming. External UIMA components can also simply be mixed with U-Compare components, without distinguishing between locally and remotely deployed resources. AVAILABILITY http://u-compare.org/

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing Multilingual Text Mining Workflows in UIMA and U-Compare

We present a generic, language-independent method for the construction of multilingual text mining workflows. The proposed mechanism is implemented as an extension of U-Compare, a platform built on top of the Unstructured Information Management Architecture (UIMA) that allows the construction, comparison and evaluation of interoperable text mining workflows. UIMA was previously supporting stric...

متن کامل

Integrating Annotation Tools into UIMA for Interoperability

In this paper, we discuss the issue of implementing the interoperability of natural language annotation tools for text mining with the Unstructured Information Management Architecture (UIMA) (Ferrucci and Lally, 2004; http://incubator.apache.org/uima). In particular, we discuss the practical issue of designing UIMA annotation schemes for text mining applications based on our experience in the E...

متن کامل

Deploying and sharing U-Compare workflows as web services

BACKGROUND U-Compare is a text mining platform that allows the construction, evaluation and comparison of text mining workflows. U-Compare contains a large library of components that are tuned to the biomedical domain. Users can rapidly develop biomedical text mining workflows by mixing and matching U-Compare's components. Workflows developed using U-Compare can be exported and sent to other us...

متن کامل

Extending an interoperable platform to facilitate the creation of multilingual and multimodal NLP applications

U-Compare is a UIMA-based workflow construction platform for building natural language processing (NLP) applications from heterogeneous language resources (LRs), without the need for programming skills. U-Compare has been adopted within the context of the METANET Network of Excellence, and over 40 LRs that process 15 European languages have been added to the U-Compare component library. In line...

متن کامل

A UIMA wrapper for the NCBO annotator

SUMMARY The Unstructured Information Management Architecture (UIMA) framework and web services are emerging as useful tools for integrating biomedical text mining tools. This note describes our work, which wraps the National Center for Biomedical Ontology (NCBO) Annotator-an ontology-based annotation service-to make it available as a component in UIMA workflows. AVAILABILITY This wrapper is f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 25  شماره 

صفحات  -

تاریخ انتشار 2009